Search CORE

14 research outputs found

A Large-Scale CNN Ensemble for Medication Safety Analysis

Author: A Sarker
A Sarker
H Gurulingappa
WYS Chou
Publication venue
Publication date: 17/06/2017
Field of study

Revealing Adverse Drug Reactions (ADR) is an essential part of post-marketing drug surveillance, and data from health-related forums and medical communities can be of a great significance for estimating such effects. In this paper, we propose an end-to-end CNN-based method for predicting drug safety on user comments from healthcare discussion forums. We present an architecture that is based on a vast ensemble of CNNs with varied structural parameters, where the prediction is determined by the majority vote. To evaluate the performance of the proposed solution, we present a large-scale dataset collected from a medical website that consists of over 50 thousand reviews for more than 4000 drugs. The results demonstrate that our model significantly outperforms conventional approaches and predicts medicine safety with an accuracy of 87.17% for binary and 62.88% for multi-classification tasks

arXiv.org e-Print Archive

Crossref

SAQC: SNP Array Quality Control

Author: A Collins
Affymetrix Inc
AJ Sharp
CC Laurie
Chien-Wei Lin
Chun-Houh Chen
DB Goldstein
DE Reich
DF Conrad
E Gunduz
E Pomares
E Pomares
FA Monzon
FJ Massey
FJ Steemers
FJ Steemers
G Zhao
GC Kennedy
H Matsuzaki
H Ogiwara
H Primdahl
H-C Yang
HC Yang
HC Yang
HM Blauw
Hsin-Chi Lin
Hsin-Chou Yang
J Huang
J Sebat
Jer-Yuarn Wu
JM Korn
JN Hirschhorn
JP Hua
K Kurashina
KK Wong
KL Gunderson
L Feuk
L Kruglyak
Ling-Hui Li
LR Cardon
M Puputti
M Stark
MA Abdulla
MC Campbell
Meijyh Kang
N Arnheim
N Rabbee
R Huggins
R Lessig
R Pomeroy
S Ishikawa
The International HapMap Consortium
The International HapMap Consortium
The International HapMap Consortium
The International HapMap Consortium
The Wellcome Trust Case Control Consortium
The Wellcome Trust Case Control Consortium
Wen-Harn Pan
WH Pan
WYS Wang
XF Zhou
XJ Di
Yuan-Tsong Chen
Publication venue: BioMed Central
Publication date: 01/04/2011
Field of study

Abstract Background Genome-wide single-nucleotide polymorphism (SNP) arrays containing hundreds of thousands of SNPs from the human genome have proven useful for studying important human genome questions. Data quality of SNP arrays plays a key role in the accuracy and precision of downstream data analyses. However, good indices for assessing data quality of SNP arrays have not yet been developed. Results We developed new quality indices to measure the quality of SNP arrays and/or DNA samples and investigated their statistical properties. The indices quantify a departure of estimated individual-level allele frequencies (AFs) from expected frequencies via standardized distances. The proposed quality indices followed lognormal distributions in several large genomic studies that we empirically evaluated. AF reference data and quality index reference data for different SNP array platforms were established based on samples from various reference populations. Furthermore, a confidence interval method based on the underlying empirical distributions of quality indices was developed to identify poor-quality SNP arrays and/or DNA samples. Analyses of authentic biological data and simulated data show that this new method is sensitive and specific for the detection of poor-quality SNP arrays and/or DNA samples. Conclusions This study introduces new quality indices, establishes references for AFs and quality indices, and develops a detection method for poor-quality SNP arrays and/or DNA samples. We have developed a new computer program that utilizes these methods called SNP Array Quality Control (SAQC). SAQC software is written in R and R-GUI and was developed as a user-friendly tool for the visualization and evaluation of data quality of genome-wide SNP arrays. The program is available online (<url>http://www.stat.sinica.edu.tw/hsinchou/genetics/quality/SAQC.htm</url>).</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central